# Conformer architecture
Asr Conformer Largescaleasr
Apache-2.0
This is an end-to-end automatic speech recognition system trained using the SpeechBrain framework, employing the Conformer architecture on 25,000 hours of English speech data.
Speech Recognition English
A
speechbrain
92
12
Indicconformer Stt Ne Hybrid Ctc Rnnt Large
MIT
IndicConformer is a Conformer-based automatic speech recognition model with hybrid CTC-RNNT architecture, specifically optimized for Nepali language
Speech Recognition Other
I
ai4bharat
36
2
Indicconformer Stt Hi Hybrid Ctc Rnnt Large
MIT
IndicConformer is a Conformer-based automatic speech recognition (ASR) model with a hybrid CTC-RNNT architecture, supporting Hindi speech transcription.
Speech Recognition Other
I
ai4bharat
1,694
3
W2v Bert 2.0
MIT
A speech encoder based on the Conformer architecture, pretrained on 4.5 million hours of unlabeled audio data, supporting over 143 languages
Speech Recognition
Transformers Supports Multiple Languages

W
facebook
477.05k
170
Fastspeech2 Conformer With Hifigan
Apache-2.0
A text-to-speech model integrating FastSpeech2Conformer with HiFi-GAN, providing efficient and high-quality speech synthesis
Speech Synthesis
Transformers English

F
espnet
635
0
Fastspeech2 Conformer
Apache-2.0
FastSpeech2Conformer is a non-autoregressive text-to-speech (TTS) model that combines the advantages of FastSpeech2 and the Conformer architecture, enabling fast and efficient generation of high-quality speech from text.
Speech Synthesis
Transformers English

F
espnet
2,440
6
Stt Rw Conformer Transducer Large
This is a large Conformer-Transducer model for Kinyarwanda speech recognition, which can transcribe speech into lowercase Latin letters, supporting spaces and apostrophes.
Speech Recognition Other
S
nvidia
116
1
Stt Zh Conformer Transducer Large
This is a large Conformer-Transducer model for transcribing Mandarin speech, with approximately 120 million parameters, trained on the AISHELL-2 dataset.
Speech Recognition Chinese
S
nvidia
72
13
Stt En Conformer Transducer Xlarge
This is an Automatic Speech Recognition (ASR) model developed by NVIDIA, based on the Conformer-Transducer architecture, with approximately 600 million parameters, specifically designed for English speech transcription.
Speech Recognition English
S
nvidia
496
54
Stt Kr Conformer Transducer Large
This is a large-scale Korean automatic speech recognition model based on the Conformer-Transducer architecture, trained on the Ksponspeech dataset, suitable for Korean speech transcription tasks.
Speech Recognition Other
S
eesungkim
129
9
Wav2vec2 Conformer Rope Large 100h Ft
Apache-2.0
Wav2Vec2 Conformer model fine-tuned on 100 hours of Librispeech data, incorporating rotary position embedding technology
Speech Recognition
Transformers English

W
facebook
99
0
Simpleoier Librispeech Asr Train Asr Conformer7 Wavlm Large Raw En Bpe5000 Sp
An automatic speech recognition (ASR) model trained on the ESPnet framework, using the Conformer architecture and the WavLM large pre-trained model, trained on the LibriSpeech dataset.
Speech Recognition English
S
espnet
66
1
Kan Bayashi Vctk Xvector Conformer Fastspeech2
A text-to-speech model trained using the ESPnet framework, utilizing the VCTK dataset, supporting multi-speaker speech synthesis
Speech Synthesis English
K
espnet
15
0
Featured Recommended AI Models